Bigtable Merge Compaction

نویسندگان

  • Claire Mathieu
  • Carl Staelin
  • Neal E. Young
  • Arman Yousefi
چکیده

We initiate the formal study of the online stack-compaction policies used by big-data NoSQL databases such as Google Bigtable, Hadoop HBase, and Apache Cassandra. We propose a deterministic policy, show that it is optimally competitive, benchmark it against Bigtable’s default policy, and suggest five interesting open problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

K-Slot SSTable Stack Compaction

We initiate the formal study of the online stack-compaction policies used by big-data NoSQL databases such as Google Bigtable, Hadoop HBase, and Apache Cassandra. We propose a deterministic policy, show that it is optimally competitive, benchmark it against Bigtable’s default policy, and suggest five interesting open problems.

متن کامل

Lightweight Indexing for Log-Structured Key-Value Stores

The recent shift towards write-intensive workload on big data (e.g., financial trading, social user-generated data streams) has pushed the proliferation of log-structured key-value stores, represented by Google’s BigTable [1], Apache HBase [2] and Cassandra [3]. While providing key-based data access with a Put/Get interface, these key-value stores do not support valuebased access methods, which...

متن کامل

Compaction Management in Distributed Key-Value Datastores

Compactions are a vital maintenance mechanism used by datastores based on the log-structured merge-tree to counter the continuous buildup of data files under update-intensive workloads. While compactions help keep read latencies in check over the long run, this comes at the cost of significantly degraded read performance over the course of the compaction itself. In this paper, we offer an in-de...

متن کامل

Write-Optimized Indexing for Log-Structured Key-Value Stores

The recent shift towards write-intensive workload on big data (e.g., financial trading, social user-generated data streams) has pushed the proliferation of the log-structured key-value stores, represented by Google’s BigTable, HBase and Cassandra; these systems optimize write performance by adopting a log-structured merge design. While providing keybased access methods based on a Put/Get interf...

متن کامل

Scaling to Build the Consolidated Audit Trail: A Financial Services Application of Google Cloud Bigtable

Google Cloud Bigtable is a fully managed, high-performance, extremely scalable NoSQL database service offered through the industry-standard, open-source Apache HBase API, powered by Bigtable. The Consolidated Audit Trail (CAT) is a massive, government-mandated database that will track every equities and options market event in the US financial industry over a six-year period. We consider Google...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014